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SYSTEM AND METHOD FOR EXPRESSING AND CALCULATING A 

RELATIONSHIP BETWEEN MEASURES 

FIELD OF THE INVENTION 

[0001] The present invention relates to the field of data analysis services, and, 

more specifically, to expressing and calculating a relationship among measures of data. 

BACKGROUND OF THE INVENTION 

[0002J Online analytical processing (OLAP) is a key part of many data warehouse 

and business analysis systems. OLAP services provide for fast analysis of 
multidimensional information. For this purpose, OLAP services provide for 
multidimensional access and navigation of data in an intuitive and natural way, providing 
a global view of data that can be drilled down into particular data of interest. Speed and 
response time are important attributes of OLAP services that allow users to browse and 
analyze data online in an efficient manner. Further, OLAP services typically provide 
analytical tools to rank, aggregate, and calculate lead and lag indicators for the data under 
analysis. 

[0003J In this context, an OLAP cube may be modeled according to a user's 

perception of the data. A cube may organize a data type according to dimensions, each 
dimension modeled according to an attribute of the data type. For example, a cube may 
organize "Sales" data according to the dimensions "Customer", "Location" and "Time." 
Typically, there is a hierarchy associated with each dimension. The hierarchy includes 
levels of granularity. For example, the time dimension can consist of years subdivided 
into quarters subdivided into months subdivided into weeks subdivided into days. The 
years level is the broadest level of granularity, while the days level is the finest level of 
granularity. Dimension members act as indices for identifying a particular cell or range of 
cells within the cube. The cube may also have measures, which measure a data type 
according to its attributes. For example, the "Sales" cube may have a measure "Sales 
Amount", which is the amount of sales made in source currencies of a corresponding 
location. 

[0004] OLAP services are often used to analytically model data that is stored in a 

relational database such as, for example, an Online Transactional Processing (OLTP) 
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database. Data stored in such a relational database may be organized according to 
multiple tables. Each such table may organize a data type according to columns 
corresponding to attributes and measures. For example, the "Sales" cube may be modeled 
according to a "Sales" table with columns corresponding to attributes "Customer", 
"Location", and "Time" and measure "Sales Amount". Furthermore, an "Exchange" table 
may include columns corresponding to attributes "Location" and "Time" and measure 
"Exchange Rate", with "Exchange Rate" being the rate of conversion of a source currency 
of a corresponding location into US currency. 

[0005] It is often desirable to calculate relationships between different measures. 

For example, it may be desirable to calculate the product of the "Sales Amount" measure 
from the "Sales" table and the "Exchange Rate" measure of the "Exchange" table to 
convert sales from a source currency into US currency. Such a relationship between 
measures is first calculated at the finest level of granularity for which data is available, and 
then the calculated results are subsequently aggregated to produce results for a broader 
selected range. For example, if the finest level of granularity for both "Sales" data and the 
"Exchange" data is the month level, then a request to convert Mexican and Canadian 
currency sales into US currency sales for a quarter is calculated for each of the two source 
currencies in each of the three months of the quarter, resulting in six separate calculations. 
The six calculations are subsequently aggregated to provide total Mexican and Canadian 
sales in US currency for the entire quarter. 

[0006] In conventional OLAP systems, data for each measure must be separately 

accessed for each calculation from each table on a row by row basis. For example, to 
convert Mexican and Canadian sales for the quarter, the "Sales" table and the "Exchange" 
table must each be separately accessed six times for each of the six rows involved in the 
calculation. Such separate access on a row by row basis may significantly prolong the 
amount of time required to perform complex calculations. Thus, there is a need in the art 
for systems and methods for calculating a relationship between measures in which the 
relationship is not calculated on a row by row basis. 

SUMMARY OF THE INVENTION 

[0007] A measure expression may include a relationship between a plurality of 

measures defined by an arithmetic operation. Each measure within the expression may 
correspond to an associated data type. Data for each data type may be stored in an 
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associated data table. A request may be received to calculate the measure expression over 
a selected range of attributes. 

[0008] In an embodiment of the invention, the request is processed by retrieving a 

plurality of caches each corresponding to an associated measure in the expression. Each 
such cache may include all rows comprising data for the associated measure over the 
selected range of attributes. All such rows may be retrieved through a single access to the 
associated data table. 

[0009] A plurality of indices may be generated each corresponding to an 

associated cache. Each such index may include data for the associated measure over the 
selected range of attributes common to every measure in the expression. Such data may 
be aggregated by attributes specific to the associated data type. 

[0010] The arithmetic operation in the measure expression may be performed on 

the data from each index on a row by row basis to achieve resulting data, and the resulting 
data may be aggregated over the selected range of attributes common to every measure in 
the expreOssion. The aggregated resulting data may provide the total measure expression 
over the selected range of attributes. 

[0011] Additional features and advantages of the invention will be made apparent 

from the following detailed description of illustrative embodiments that proceeds with 
reference to the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] The illustrative embodiments will be better understood after reading the 

following detailed description with reference to the appended drawings, in which: 
[0013] Fig. 1 is a block diagram representing a general purpose computer system 

in which aspects of the present invention and/or portions thereof may be incorporated; 
[0014] Fig. 2 is a block diagram of an exemplary system for analytically modeling 

data in accordance with the present invention; 

[0015] Fig. 3 is a sample relational database table corresponding to "Sales" data in 

accordance with the present invention; 

[0016] Fig. 4 is a sample relational database table corresponding to "Exchange" 

data in accordance with the present invention; 

[0017] Fig. 5 is a flowchart of an exemplary method for calculating a measure 

expression in accordance with the present invention; 
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[0018] Fig, 6 is an exemplary first cache in accordance with the present invention; 

[0019] Fig. 7 is an exemplary first index in accordance with the present invention; 

[0020] Fig. 8 is an exemplary second cache in accordance with the present 

invention; and 

[0021] Fig, 9 is an exemplary calculation of a measure expression in accordance 

with the present invention. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 
[0022] Systems and methods that meet the above-mentioned objects and provide 

other beneficial features in accordance with the presently preferred exemplary 
embodiments of the invention will be described below with reference to aforementioned 
Figures. Those skilled in the art will readily appreciate that the description given herein 
with respect to those figures is for explanatory purposes only and is not intended in any 
way to limit the scope of the invention. Throughout the description, like reference 
numerals will refer to like elements in the respective figures. 

Computer Environment 

[0023] Fig. 1 and the following discussion are intended to provide a brief general 

description of a suitable computing environment in which the present invention and/or 
portions thereof may be implemented. Although not required, the invention is described in 
the general context of computer-executable instructions, such as program modules, being 
executed by a computer, such as a client workstation or a server. Generally, program 
modules include routines, programs, objects, components, data structures and the like that 
perform particular tasks or implement particular abstract data types. Moreover, it should 
be appreciated that the invention and/or portions thereof may be practiced with other 
computer system configurations, including hand-held devices, multi-processor systems, 
microprocessor-based or programmable consumer electronics, network PCs, 
minicomputers, mainframe computers and the like. The invention may also be practiced 
in distributed computing environments where tasks are performed by remote processing 
devices that are linked through a communications network. In a distributed computing 
environment, program modules may be located in both local and remote memory storage 
devices. 
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[0024] As shown in Fig. 1, an exemplary general purpose computing system 

includes a conventional personal computer 120 or the like, including a processing unit 
121, a system memory 122, and a system bus 123 that couples various system components 
including the system memory to the processing unit 121. The system bus 123 may be any 
of several types of bus structures including a memory bus or memory controller, a 
peripheral bus, and a local bus using any of a variety of bus architectures. The system 
memory includes read-only memory (ROM) 124 and random access memory (RAM) 125. 
A basic input/output system 126 (BIOS), containing the basic routines that help to transfer 
information between elements within the personal computer 120, such as during start-up, 
is stored in ROM 124. 

[0025] The personal computer 120 may further include a hard disk drive 127 for 

reading from and writing to a hard disk (not shown), a magnetic disk drive 128 for reading 
from or writing to a removable magnetic disk 129, and an optical disk drive 130 for 
reading from or writing to a removable optical disk 131 such as a CD-ROM or other 
optical media. The hard disk drive 127, magnetic disk drive 128, and optical disk drive 
130 are connected to the system bus 123 by a hard disk drive interface 132, a magnetic 
disk drive interface 133, and an optical drive interface 134, respectively. The drives and 
their associated computer-readable media provide non-volatile storage of computer 
readable instructions, data structures, program modules and other data for the personal 
computer 120. 

[0026] Although the exemplary environment described herein employs a hard disk, 

a removable magnetic disk 129, and a removable optical disk 131, it should be appreciated 
that other types of computer readable media which can store data that is accessible by a 
computer may also be used in the exemplary operating environment. Such other types of 
media include a magnetic cassette, a flash memory card, a digital video disk, a Bernoulli 
cartridge, a random access memory (RAM), a read-only memory (ROM), and the like. 
[0027] A number of program modules may be stored on the hard disk, magnetic 

disk 129, optical disk 131, ROM 124 or RAM 125, including an operating system 135, one 
or more application 212 programs 136, other program modules 137 and program data 138. 
A user may enter commands and information into the personal computer 120 through 
input devices such as a keyboard 140 and pointing device 142 such as a mouse. Other 
input devices (not shown) may include a microphone, joystick, game pad, satellite disk, 
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scanner, or the like. These and other input devices are often connected to the processing 
unit 121 through a serial port interface 146 that is coupled to the system bus, but may be 
connected by other interfaces, such as a parallel port, game port, or universal serial bus 
(USB). A monitor 147 or other type of display device is also connected to the system bus 
123 via an interface, such as a video adapter 148. In addition to the monitor 147, a 
personal computer typically includes other peripheral output devices (not shown), such as 
speakers and printers. The exemplary system of Fig. 1 also includes a host adapter 155, a 
Small Computer System Interface (SCSI) bus 156, and an external storage device 162 
connected to the SCSI bus 156. 

[0028] The personal computer 120 may operate in a networked environment using 

logical connections to one or more remote computers, such as a remote computer 149. 
The remote computer 149 may be another personal computer, a server, a router, a network 
PC, a peer device or other common network node, and typically includes many or all of 
the elements described above relative to the personal computer 120, although only a 
memory storage device 150 has been illustrated in Fig. 1. The logical connections 
depicted in Fig. 1 include a local area network (LAN) 151 and a wide area network 
(WAN) 1 52. Such networking environments are commonplace in offices, enterprise-wide 
computer networks, intranets, and the Internet. 

[0029] When used in a LAN networking environment, the personal computer 120 

is connected to the LAN 151 through a network interface or adapter 153. When used in a 
WAN networking environment, the personal computer 120 typically includes a modem 
154 or other means for establishing communications over the wide area network 152, such 
as the Internet. The modem 154, which may be internal or external, is connected to the 
system bus 123 via the serial port interface 146. In a networked environment, program 
modules depicted relative to the personal computer 120, or portions thereof, may be stored 
in the remote memory storage device. It will be appreciated that the network connections 
shown are exemplary and other means of establishing a communications link between the 
computers may be used. 

Exemplary Systems and Methods of the Present Invention 

[0030J An exemplary system for analytically modeling data in accordance with the 

present invention is shown in Fig. 2. As shown, an analytical data service 220 such as, for 
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example, an On-Line Analytical Processing (OLAP) service may be employed to model 
data stored in a relational database 200 such as, for example, an On-Line Transactional 
Database (OLTP). Analytical data service 220 may present analytically modeled data via 
reporting client 230. As set forth previously, data stored in relational database 200 may be 
organized according to multiple tables, with each table including data corresponding to a 
particular data type. A table corresponding to a particular data type may be organized 
according to columns corresponding to data attributes and measures. 
[0031] One such table is shown in Fig. 3, with a "Sales" data type organized in a 

"Sales" table 300 with columns "Customer ID" 310 , "Country" 320, "Month" 330, and 
"Sales Amount" 340. "Customer ID" column 310 , "Country" column 320, and "Month" 
column 330 are attribute columns that provide the attributes of the "Sales" data type at the 
finest granularity for which data is available. Specifically, "Customer ID" 310 is the finest 
granularity for a customer attribute of the "Sales" data type; "Country" 320 is the finest 
granularity for a location attribute of the "Sales" data type; and "Month" 310 is preferably 
the finest granularity for a time attribute of the "Sales" data type. "Sales Amount" column 
340 is a measure column that includes data for the measure "Sales Amount" according to 
customers, countries, and months in "Customer ID" column 310, "Country" column 320, 
and "Month" column 330, respectively. "Sales Amount" 340 is the amount of sales made 
in source currencies of corresponding countries. 

[0032] Referring now to Fig. 4, an "Exchange" data type is organized in an 

"Exchange" table 400 with columns "Country" 410, "Month" 420, and "Exchange Rate" 
430. "Country" column 410 and "Month" column 420 are attribute columns that provide 
the attributes of the "Exchange" data type at the finest granularity for which data is 
available. Specifically, "Country" 410 is the finest granularity for a location attribute of 
the "Exchange" data type; and "Month" 420 is the finest granularity for a time attribute of 
the "Exchange" data type. "Exchange Rate" column 430 is a measure column that 
includes data for the measure "Exchange Rate" according to countries and months in 
"Country" column 410 and "Month" column 420, respectively. "Exchange Rate" 340 is 
the rate of converting a source currency into US currency. 

[0033] Analytical data service 220 in accordance with the present invention 

enables a measure expression to be defined for a plurality of measures such as, for 
example, "Sales Amount" measure 340 from "Sales" table 300 and "Exchange Rate" 
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measure 430 from "exchange" table 400. The measure expression includes a relationship 
between the plurality of measures defined by an arithmetic operation. For example, a 
measure expression "US Sales" may be defined as the product of the measure "Sales 
Amount" 340 and the measure "Exchange Rate" 430 as shown below: 

(US Sales)ME = (Sales Amount^ * (Exchange Rate)M 

The measures included in the measure expression need not necessarily measure data types 
with common attributes. For example, the customer attribute of the "Sales" data type is 
not an attribute of the "Exchange" data type. 

[0034] Analytical data service 220 may receive and process a query requesting a 

calculation of a measure expression for a selected range of attributes. An exemplary 
method for processing such a query in accordance with an embodiment of the present 
invention is set forth below with reference to Fig. 5. Although a measure expression may 
include a relationship between any number of measures, the method set forth below is 
described with reference to a measure expression including two measures. Importantly, 
however, the method set forth below is not limited to a measure expression including two 
measures. The method may be applied to a measure expression including more than two 
measures by propagating the processing steps to the additional measures in the expression. 
[0035] Generally, data within a selected range of attributes for measures in the 

expression may be retrieved from database 200 and cached. Importantly, because the 
measure expression is defined as a relationship between measures, all rows comprising 
data within the selected range of attributes for each measure in the expression may be 
retrieved and cached through a single access to a corresponding data table. For example, 
to calculate "US Sales", all rows comprising data within the selected range of attributes for 
"Sales Amount" measure 340 may be retrieved and cached through a single access to 
"Sales" table 300, while all rows comprising data within the selected range of attributes 
for "Exchange Rate" measure 430 may be retrieved and cached through a single access to 
"Exchange" table 400. Such a single access to retrieve multiple rows of data is preferable 
to multiple accesses to retrieve data on a row by row basis. Reducing the number accesses 
to data tables may significantly reduce the amount of time required to process the request. 
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[0036] Referring now to Fig. 5, at step 510, a measure expression is defined at 

analytical data service 220. The measure expression may be defined as an arithmetic 
operation performed on a first measure and a second measure. For example, the "US 
sales" measure set forth above is defined as the product of a first measure "Sales Amount" 
340 and a second measure "Exchange Rate" 430. 

[0037] At step 512, a query is received at analytical data service 220 requesting a 

calculation of the measure expression for a selected range of attributes. For example, a 
query may request a calculation of "US Sales" for Mexican locations for customer "1" for 
a first quarter (January, February, and March). 

[0038] At step 514, a first cache is retrieved from a first data table in database 200 

corresponding to the first data type. The first cache includes data for the first measure 
over the selected range of attributes. An exemplary first cache 600 corresponding to the 
measure expression "US Sales" is shown in Fig. 6. First cache 600 is retrieved from 
"Sales" table 300 and includes "Customer ID" column 610, "Country" column 620, 
"Month" column 630, and "Sales Amount" column 640. "Customer ID" column 610 
includes entries for Customer "1" only, which comprises the selected range of customer 
attributes. "Country" column 620 includes entries for Mexico only, which comprises the 
selected range of location attributes. "Month" column 630 includes entries for January, 
February, and March only, which comprise the selected range of time attributes. "Sales 
Amount" column 640 is a measure column that includes data for the measure "Sales 
Amount" 340 according to customers, countries, and months in "Customer ID" column 
610, "Country" column 620, and "Month" column 630, respectively. Importantly, all rows 
in first cache 600 may be compiled through only a single access to "Sales" table 300. 
[0039] At step 516, a first index is generated from the first cache. The first index 

includes data for the first measure over the selected range of attributes common to the first 
data type and the second data type. Consequently, in the first index, data for the first 
measure is aggregated by attributes specific to the first data type. An exemplary first 
index 700 corresponding to the measure expression "US Sales" is shown in Fig. 7. First 
index 700 includes "Country" column 710, "Month" column 720, and "Sales Amount" 
column 730. "Country" column 710 and "Month" column 720 are attribute columns 
corresponding to attributes common to both the "Sales" data type and the "Exchange" data 
type. As should be appreciated, a "Customer ID" column 610 is included in first cache 
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600 but is not included in first index 700 as it is specific to the "Sales" data type. "Sales 
Amount" column 730 is a measure column that includes data for the measure "Sales 
Amount" 340 according to countries and months in "Country" column 710 and "Month" 
column 720, respectively. 

[0040] Data in "Sales Amount" column 730 is aggregated to include sales for 

Customer "1" only, which comprises the selected range of customers. Specifically, as 
shown in "Sales" table 300, 3.00 pesos of sales were made in Mexico in January, 1.00 
peso of which were made to Customer "1" and 2.00 pesos of which were made to 
Customer "2". Consequently, first index 700 shows that only 1.00 peso of sales were 
made in Mexico in January, to reflect the 1.00 peso of sales made to Customer "1" and not 
reflect the 2.00 pesos of sales made to Customer "2", as Customer "2" is not within the 
selected range of customers. 

[0041] At step 518, a second cache is retrieved from a second data table in 

database 200 corresponding to the second data type. The second cache includes data for 
the second measure over the selected range of attributes. An exemplary second cache 800 
corresponding to the measure expression "US Sales" is shown in Fig. 8. Second cache 
800 is retrieved from "Exchange" table 400 and includes "Country" column 810, "Month" 
column 820, and "Exchange Rate" column 830. "Country" column 810 includes entries 
for Mexico only, which comprises the selected range of location attributes. "Month" 
column 820 includes entries for January, February, and March only, which comprise the 
selected range of time attributes. As should be appreciated, second cache 800 does not 
include a "Customer ID" column because customer data is not applicable to the 
"Exchange" data type. "Exchange Rate" column 830 is a measure column that includes 
data for the measure "Exchange Rate" 430 according to countries and months in 
"Country" column 810 and "Month" column 820, respectively. Importantly, all rows in 
second cache 800 may be compiled through only a single access to "Exchange" table 400. 
[0042] At step 520, a second index is generated from the second cache. The 

second index includes data for the second measure over the selected range of attributes 
common to the first data type and the second data type. Consequently, in the second 
index, data for the second measure is aggregated by attributes specific to the second 
measure. For "US Sales", because the second cache 800 includes only the location and 
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time attributes common to both the "Sales" data type and the "Exchange" data type, the 
second index is identical to second cache 800. 

[0043] At step 522, the arithmetic operation defined in the measure expression is 

performed on the data for the first measure from the first index and the data for the second 
measure from the second index. The operation is performed on a row by row basis, 
whereby each measure expression entry is calculated based on the first measure entry and 
the second measure entry from the corresponding row. 

[0044] An exemplary measure expression calculation 900 corresponding to the 

measure expression "US Sales" is shown in Fig 9. Calculation 900 includes "Country" 
column 910, "Month" column 920, "Sales Amount" column 930, "Exchange Rate" 
column 940, and "US Sales" column 950. "Country" column 910 and "Month" column 
920 are attribute columns corresponding to attributes common to both the "Sales" data 
type and the "Exchange" data type. "Sales Amount" column 730 and "Exchange Rate" 
column 940 are measure columns that include aggregated data for the measures "Sales 
Amount" 340 and "Exchange Rate" 430 from first index 700 and second index 800, 
respectively. 

[0045] "US Sales" column 950 is a measure expression column that includes data 

for the measure expression "US Sales" according to countries and locations in "Country" 
columns 910 and "Month" column 920, respectively. Entries in "US Sales" 950 are 
calculated on a month by month basis as the product of the entries in "Sales Amount" 
column 930 and "Exchange Rate" column 940 for the corresponding month. 
[0046] At step 524, the total measure expression for the selected range of attributes 

is calculated by aggregating the resulting measure expression entries for each row 
calculated at step 522. As shown in calculation 900, the total measure expression "US 
Sales" for the first quarter, "$14.50", is calculated by summing the individual measure 
expressions entries for each month in the first quarter. 

[0047] While the present invention has been described in connection with the 

preferred embodiments of the various figures, it is to be understood that other similar 
embodiments may be used or modifications and additions may be made to the described 
embodiment for performing the same function of the present invention without deviating 
therefrom. Therefore, the present invention should not be limited to any single 
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embodiment, but rather should be construed in breadth and scope in accordance with the 
appended claims. 
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